ED 283 874 



TH 870 413 



AUTHOR 
TITLE 

PUB DATE 
NOTE 



PUB TYPE 



EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



Ola jnik ^ Staphan ; Algina , Jamas 

Bootstrap Estimation and Testing for Varianca 

Equality. 

Apr 87 

29p.| Paper presented at tha Annual Heating of the 
Ameriean Educational Rasaarch Association 
(Washington, DC, April 20-24, 1987). 
Spaaches/Conierence Papers (150) ~ Raports ^ 
Researah/Teahnical (143 ) 

iar01/PC02 Plus Postaga. 

Comparative Analysisi Estimation (Mathamatics } ; 
Maasurement Techniques ; ^Monta Carlo Mathods; 
*i*opulation Distribution I Sampla Sizei ^Sampling; 
^Statistical Distributionsi ^Statistical 
Significaneai Statistical Studies 
^Bootstrap Mathods i Chi Square Testi 7 Ratio; 
Population Validity; Statistical Analysis System; 
Typa I Errors; ^Variance (Statistical) 



ABSTRACT 

The purpose of this study was to davelop a s?.ngla 
procadura for comparing population variances which could be used for 
distribution forms* Bootstrap roathodology was used to estimate tha 
variability of the sampla varianca statistic when %hm papulation 
distribution was normal, platykurtic and laptokurtic^ The data for 
the study were genaratad and analyzad using the Statistical Analysis 
System computing packaga, Tha bootstrap estimates of variability 
underestimated tha thaoretical variance value, and tha maan sguare 
error of tha astimatad variance was small for both tha normal and 
platykurtic distributions^ but large for tha laptokurtic 
distribution. The F-ratio and chi-sguara tast statistics were 
computad for comparing the variability oi two populations using the 
bootstrap estimates of varianca, Obsarvad Type I error ratas within 
two standard errors o£ tha nominal significance level were obtainad 
only when the population distribution was platykurtic, Tha study 
concluded that in tha case of sample variance the bootstrap 
methodology could provide some indication of sampla accuracy but the 
degraa of accuracy would depend on the distribution form^ (JA2) 
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ABSTRACX 



Bootstrap methodolofy was used to estlinate the 
variability of the sample variance, statlatlc when the 
population distribution was nor ma 1, platykurtlc and 
leptpkurtlc. The booCstrap estimates of variability 
underestimated the theoretical variance value and the mean 
square error of the estimated variance was small for both 
the normal and platykurtlc distributions but large for the 
leptokurtic distribution* The F--ratio and chl-square test 
statistics were compiited for comparing the variability of 
two populationa using the bootstrap estimates of variance, 
Observered Type I error rates within two standard errors of 
the nominal significance level were obtained only when the 
population distribution was nlatykurtic. 
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Bootstrap Estimation and Testing for Variance Equality 

Developing statistical procedureii for comparing two or 
more population variances has been an Interest among 
statisticians for many years* The first procedures 
developed (e*g. Bartlett, 1937| Cochran, 1941| Hartley, 
1950) were based on the likelihood ratio test* These 
procedures all assume that the sampled population 
distributions are normal* When the distributions are 
non-normal the actual Type I error rate can be greater than 
or less than the nominal significance level depending on 
whether the distributions are platykurtlc or leptokurtlc 
respectively (Box, 1953), Since in practice the population 
dlstrlbutlnn form Is generally unknown and often non-normal, 
these approaches are of limited value. Procedures were then 
developed which attempted to modify the likelihood ratio 
tests by adjusting for the kurtosls of the distribution (e*g 
Box, 19531 Scheffe, 1959; Layard , 1973), These procedures 
estimate the populatfon kurtosls using sample data. But 
kurtosls estimated on a sample is not a very .cable 

tatlstlc and these tests typically have actual Type 1 error 
rates which devlite considerably from the nominal 

Ignlflcance level, A third approach taken for testing 
variance equality has been the use of n on pa r ame t r 1 c rank 
tests of scale (eg. Mood, 1 954 | S lege 1 -Tukey , 1960 | KIoce, 
1962)* The nonpa rame t r I c tests however are extremely 
sensitive to differences In the location parameter (Moses, 
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1963)# As diffarences in the population med . s \ncT&^^^ 
the procadures becoma increasingly conservati.v *d 
statistical power decreases. Adjusting for iifferencf^ in 
sample medians or sample means has bean sugp^ — : J \ ' it ase 
procedures do not provide valid tests of sgmI ^ sq ^ /tv when 
the distributions sampled are asymmeCric (Co^n.c vc t^ T pin n son 
and Johnson, 198M Olejnlk and Alglna^^ 1987>, 

Still another category of tests for varlince equality 
are procedures which transform the original data to a 
measure which reflects variability rather than location and 
compares the mean transformed scores using analysis of 
variance (eg. Miller, 1968| Brown and Forsythei 1974; 
O'Brien* 1978). fSmplrlcal studies of these approaches have 
had mixed results* Conoverj Johnson and Johnson (1981) 
compared 59 procedures (excluding 0*Brien*s) from the four 
categories of approaches and could only recommend the 
BroMn-Forsy the for non^normal d 1 s t r i bu c 1 on s • Additional 
studies have shown both the Brown-Fo rs the aud the O'Brien 
procedures as valid tests with reasonable statistical power 
(O'Brien, 1978; Olejnik and Algina, 1987). Neither 
procedure is uniformly superior to the other for all 
distribution forms however* O'Brien's procedure Is more 
powerful with normal and platykurtic distributions and the 
Br own=Fo r s t y he approach is more sensitive to variance 
differences when the rtistributlons a re leptokurtlc* 
Although both of these procedures provide valid tests for 
variance equality it would be convenient if a single 



procedura could h mm developed that could be used for all 
distribution foririi^s with equal statiscical power* 

A new approa^^h that might be suggested to teat for 
variance equallCy la an application of the boocatfap 
tne thodo logy • Thif&i resampling procedure es 1 1 ma c e s Elm e 
standard irror of a Btatlatlc of interest by amplrlcally 
developing the samnpling distribution of the statlsEtc 
through Honte CarLLo simulation procedures. BegiiiiilTx^ with a 
sample of sl^e n i^onits randomly selected from a poptt 1 a t J on , 
multlpla boDtstrap« estimates of the statlatlQ of Interest 
are calculated* TPt«ese estimates are based on subganipJLes of n 
Units ereatedby ^-^e sampling with rcplacemenc frowEh^ 
original nunlt^* The standard deviation of the 
distribution of bo *otstrap estimates provides a fneasu :re of 
accuracy of the sc atfstlc. Although It la not cloar how 
many bootstrap est Imatea are needed to provide a good 
estimate of icnura> ey ^ Efron and Tlbshlranl (19€63ha« 
suggested that as few r^s 25 subsamplea may be sufficient for 
some situations ati»d 200 subaamples a ho u Id be sufficient for 
moat estlmatora, *The approach has been used withsueceaa In 
several contexts (TXunneborg and Touslgnant, l 985jEfi^on and 
Tlbshlranl, 1986). Given that the standard eirprof the 
sample variance caftm be estimated accurately It c©iild then be 
used to calculate ^ran F or a chl^square test st^Ctitl^* Since 
the standard error ia determined empirically ivom sample 
data* the approach cfjuld be used for all population 
distribution farms 



The present study „ad two o. bjectl.es, First to apply th 
bootstrap .ethodolo,y to estt.-ate tha accuracy of th. ,a„pi 
variance and to conpar. thia a=.tlmate with tha theoretical 
m.asur. of accuracy derived by Box ( 1 953 ). Sacond. use the 
empirical estimate of the sa.ptie variance standard error to 
compute a test staUatlc -and develop the samplinR 
dlatrlbutlon of that test ,tacS«tlc to determine the Type I 
«rror rate under a m,e null c<>«ndltlon and the statistical 
power when pop u 1 a t lo„ va r lane . differ. Finally the Type I 
• rror rate and the statistical power of thla approach ie 
compared to s Im 1 la r r.su 1 ta ebt.alned using O.Brlen.s teat. 
Bootstrap estimate offlccuracy 

Method. The ada.uacy of tH« bootstrap methodology to 
astl.ate the variance of th« u„b«iased sample variance was 
studied under s e ve ral cond 1 c t o„s« . Three factors were 
manipulated, sample BUa (n). ,i .strlbutlon form and the 
number of bootstrap sub.ample. ( b). Four sample sl.es were 
considered from t h rae d Is t r l b u 1 1. o n a 1 forms. Samples of sl.e 
9, 15. 25 and .30 wero selected f , rom distributions which were 
normal (0,0). platykurtlc (0.^1) leptokurtlc (0,3.75). 

Since It is not claarhow many bootstrap samples are needed, 
three levels of this factor w«r, considered: 50. 100 and 
200. Thus this part of the studr^ considered 36 conditions 
in a completely crossed 4x3k3 factorial design. Each of 
these conditions «er a rapl i ca t ed 200 times to determine, the 
average bootstrap estl«te of var. lanca ( ) . the average 
estimate of va r i. b i 1 1 cy f o r the sample variance rVaF(s2), 



and t:he avirage mean square error of the lootstrap estlinate 
of variability CMSE), The mean square ^r^or wai ealcul^tad 
ag tHie average aquared difference bet^e^fl each replication 
sgtilmace of variability and the t h eo re t ICal meas u ra of 
variance provided by Bok ( 1953 )# The varienca of the 
unblssad aample astlmata of the popula t iPn variance le 
calcttjlated as the followlngt 

«^bef« o is tVxm squared population varl^nct; 

n is thtm sample mizB; 
dnd k is thm kurtnsls of the popul a t f c^n di s t r 1 bu t Ion * 

Data Geneg^^tlQn . The data for the ac'id|were genaratad 
and a^nalyzid using the Statistical Ana l>^g 1 e Sys tern 
(SAgj 1984) computing package. ObservatlOrt^ yere generated 
havin if a wean of 10 and variance equal to luslng RANNOR, 
the a^ornial rand oni number ganeratlng functisHn SAS- The 
Tion-^n ormal dl i t r* 1 b u t i on s ^ were generaced b y trans for nil ng the 
norma 1 random variables using a pnlynaitilal power procedure 
gugge seed by Fleishman ( 1978): W ^ [ ( d c ) K+b] K + a * Where x 
im th ^ normally distributed random vaftab^^^nd a, b, c» and 
d art conetants which modify the Rkew %nd T<yrtoslfl of the 
dletr dbution leaving the mean and varl^nc^ imc anged, 

procedure , For each condition s cud led i random sample 
of n ■observatlOT^ a were generated* Boo ts t j subsamples were 
than re a ted by resampling with roplacefn^ntfrom the 



8 



ofigl^^al Sample oP n obsefvptlons with each bootstap 
subsafcmple conalsting of n scons. The unbiased estimate of 
the s«mple variance of each bootstrap Mubeample was then 
calcumaued Z(X^ ) creating a sampling 

dlstrCibut ion of the sample v^^dance hawed on B estimates of 
the 5«mple variance* Finally the mean and variance of the 
samplL^ng dlstrlbutjon was coinpycad alor-ag with the estimated 
aquare-d error (the di f f erenc © liitween tehe astimated sample 
varlan ^ce and the thfeoretleal value for the variance of the 
sample variance). This p roc ed iira wa s r»plicatad 200 times 
and th m average mean variance (8 ) , the— average variance 
across the bootstrap samples <vir(S^)) and the mean squared 
error *<MSE) were computed* Irpllot ta-stlng the program It 
was no»ted that over the 200 replication s the average mean 
varlan«^e consistently underestimated th m population variance 
which ^^as equal to 1. The undares 1 1 ma t ion was greatest when 
the samiple size was small* Before gene rating the sampling 
distributions of the sample variances, ^he program was 
modified to include two addtttOfial meth^Dds of calculating 
Che var^lan-e of the hootatrap lubsamplaM* Method 2 divided 
the sunm of squared devlatlong around tti« subsample mean by 
n-2 rat^her than n--!, ( - ZC -U^/ (n-S ) ). Kethod 3 
caleulaited the sum of squared devlaciQns of the bootstrap 
subaamp le observations around th mean of the original n 
observa tions from which th% bo^tstrflp gimbsamplea were 
gener^t ed. The sum of squmre^ were thgi^ divided by n-l. 
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CS^I E (X^-M )^)/(p"l) )^ In p^-ilot testing the revised 
program the nu^berof re p I lea ^ t Ions per condition was 
Increased to 500^ The result s for 200 and 500 replication 
were not notlcably different ao the sampling dlatrltautlons 
were generated using 200 repl leatlone* 

R e s u 1 ^ 5 # Tablet presenc m the results using the three 

methods of calc.ulatliig the bo otstrap subsample variance, for 
sample sl^es of 9|i5, 25 and 30 using 50, 100 or 200 
bootstrap subsatni^ha when the population distribution 

sampled was notitiah tables 1 T and Ttl report similar 

results when the sampled dist^ ributlons were platykurtlc and 

leptokurtlc r^tsp%ctl\?ely* Fo r all four sample sizes and all 



Insert tables II and III hare 



three methods of cglnulatlng fcche subsample variance the 
results were sliritlKwhen the number of bootstrap subsamples 
were 50, 100 or 2 OOi Using me^^hod 1 for calculating the 
unbiased estltflat^ d samplG v^srlances the average variance 
(S ) conaistaritly uiide re s c 1 ma ed the population variance 
across all satflple d?,es and a^sross all sl^.es of bootstrap 
subsamples. The avorflge varlannce (varCS )) of the sample 
variances also undorost Ima ted the theoretical value of the 
sample variance. For c he plat k u r t f c d 1 s c r t b u 1 1 o n the 
difference was not^reat exce i=3 t for a sample sl^e of 9. The 
greatest dlff^renctwas found with the leptokurtlc 
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dlstrlbutton and a moderate rfifferei^ce was observed for the 
normal distribution, 

Using method 2 which U0&6 n-Z c— ^ther than n-l as the 
denominator for the nalculacaon of fehe bootstrap subsample 
variance^ the result.^ were ffi=3ieh cipher to the theoretical 
values* The lowest irtean vaflsLance e&^tlmate equalled #944 and 
the largeat mean variance egfeimate ^-quailed 1 ,075, The 
average variance deviated mo^t from the theoretleal value 
when the original saniple bad only 9 o bee rva 1 1 ons • As sample 
sl^e Increased thi average \/»riance approached the 
theoretical value for the Sanmplg var lance. 

Method 3 which used the origlna 1 sample mean In 
calculating the siibs^imple var lance a iso had results similar 
to the theoretical vilueg for the aa=Tiple variance and the 
mean variance* The sma 1 le t mean vanrlance was calculated as 
.965 and the largeitmean vtr lanca e«:iualled i*094. Only 
with a sample of si^e 9 did t liemaan variance deviate 
greatly froin th^ tliaofetlcal -^alue f^^r the variance, 

EKamlninl the tnean square irror^^ for all three methods 
Indicate consldQfable variabiUty bet^ween the eiitlmated 
variance of the fainpU varlart^aand ^^he theoretical value. 
The errors were graitist for »he 1 ep ^= oku r 1 1 c distribution 
and smallest for the pla tyku ffc Ic distribution. For the 
normal d 1 s t r Ibu t ion the errors appeair:red small when the 
sample size was at least 15* These r^esults Indicate that 
estimating tha standard efrQ^ of the sample variance may be 
adequate for hypotheBls testir^gonly when sample sizes are 



at least 15 and the population distribution Is normal or 
platykurtic. With a leptokurUic dtstrlbutlon the average 
variance for all three methods seriously und e re s t Ima t ed the 
theoretical value of variance. More Importantly the meiii 
square error was lar^e for the three methods of caldulaclnf 
subsample variance for all sample mizmv. How Important tfie^e 
differences are for hypothesis testing for variance equality 
is not clear* However one might expect that using the 
bootstrap methodology to estimate the standard error fora 
sample variance would result in a liberal test for varladce 
equality If the sampled distribution was leptokurtlc- With 
a platykurtic population the test may bp valid for even 
small sample si^^es. When the population distribution Is 
normal the bootstrap procedure mi^ht be valid when sample 
sizes are at least 15. The second part of the study 
Investigated the nccuracy of these predictions. 

Testing for variance equality 

Method . Two classical procedurns for comparing 
population parameters using sample statlstfcs were 
considered* The sample variances were compared using nn 
F^ratio and a chl-square test statistic. Although both 
procedures could be used to compare the variances of snvef/il 
populations, the present study was limited to a comparison 
of two populations* An F=ratlo can he computed as the ratio 
of the squared difference between two sample statlstins And 
the sum of the variances of the two statistics. In the cbhb 
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of the sample variance the F-ratio Is computed as the 
following 

F ^ ^ -1 -2 ^ 

Var(sJ) + VarfS2) 

The test statistic Is referred to an F distribution with 1 
and n^+n^^Z degrees of freedom, 

A chl-square teat statistic can he constructed by 
summing the ratio of the squared difference between the 
sample statistic and the mean sample statistic across all 
comparison groups and the variance of each sample statistic. 
In the case of the sample variance the chl-square statistic 
can be calculated as the following* 



Var(sJ) 

The test statistic is referreB to a chl^square distribution 
with 1 degree of freedom. 

For both statistics the variances were estimated using 
the bootstrap methodoLgy. Both of the procedures considered 
assume that the sampling distribution of the sample 
statistic is normal. Since the sampling distribution of a 
sample variance is positively skewed, it was necessary to 
transform the sample statistic before computing the test 
statistic. Two transformations were considered. The sample 
variance was transformed by taking the log of the sample 
statistic* This is the approach taken In Bartlett's test. 
The second transformation considered was recently suggested 
by Hawkins and Wlxley (1986) using the fourth root of the 
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chl-squared variable. It was hoped that an batter 
approxlnialilOTi to normality could improve the Type I error 
rate and statistical power of the tests* 

A third test for variance equal Ity was conslderad in 
the study which did not use the bootstrap methodology* 
O'Brien's (1978) procedure for eomparlng variances was 
included for two reasons: first since O'Brien's test is 
currently one of the best procedures for comparing 
population variances it provides a useful Index for 
comparing observed Type I error rates when the null 
hypothesis is true and second it provides a standard to 
evaluate the statistical power of the procedures using the 
bootstrap methodology when the populatlGn variances 
differed* O'Brien suggested that a test for variance 
equality could be developed by transfer rnlng the sample data 
u s 1 ng I 

2 

r ^ (w+n^2)n.(x.,-x .) --ws.Cn.-l) 



(n,-l)(n,-2) 
.1 3 

where Sj is the within ^roup unbiased estimate of variance 
for sample J and w is a weighting factor. O'Brien (1981) 
recommends setting w=.5 for most situations. The 
transformed observations are then used as the outcome 
measure In ealculatlng the ANOVA F^ratio. 

Da ta genera t ion . The data for this part of the study 
were generated using the Rannor random generating function 
In the SAS computing package* Two factors were manipulated 
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sample size and dLstribution form. Since estimates of 
sample variance variability in the first part of the study 
ware shown to be somewhat accurate when sample sizes were at 
least 15s three sample size combinations were Included in 
compariaons of variances: (15sl5), (25,25) and (30,30). 
Since neither O'Brien's test nor the bootstrap procedures 
are affected by sample sl^e Inequality under the null 
condition, only equal sample sizes were considered. Three 
distributional forms were studied* normal (0,0), platykurttc 
(Qj'-'l) and leptokurtic (0,3.75). The d I s t r 1 b u C 1 on a 1 forms 
were generated using the same methodology as described in 
part 1 of the Investigation. 

Finally, since the estimated stability of the sample 
variances In part 1 was not greatly affected by the number 
of bootstrap subsamples used, the estimated varian^^es of the 
sample variance was based on 100 bootstrap suhsamples. 

Procedure « For each condition studied a random sample 
of n observations were generated for each of two independent 
groups* The sample variances were then compared using 
0*Brlen'a test. Then For each group the variance of the 
sample variance was estimated separately using the bootstrap 
methodology. For each bootstrap sub sample the subsample 
variance was comp uteri using the tliree methods described 
earlier. Each of these estimates was transformed using the 
log transformation and the 4th root transformation before 
the variance of the sampling distribution was calculated. 
Before computing the F-ratio and ehl--square statistics, the 
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sample variances of the original n observations were also 
transformed. Thus for each condicion studied 13 test 
statistics were computed^ O'Brien's, the six teats using the 
F-ratlo (3 using the log transformation and 3 using tha 4th 
root transformation) and six rests using the chl^square 
statistic. This procedure was replicated 1000 times to 
develop the sampling distribution of the 13 test statistics 
for each of the 9 conditions under Investigation* The 
proportion of times the null hypothesis was reiected at the 
•Olj m05 and ,10 levels were recorded. 

Re suits . The empirically determined Type I error rates 
for the 13 test statistics are reported in tables IV* V and 
VI for the .01, ,05 and .10 levels of significance 
respectively. O'Brien's test statistic had observed Type I 



Insert Tables IV, V, and VI here 



error rates within two standard errors vE the nominal 
significance levels for all conditions studied except the 
condition where the sampled distribution was normal and 
sample size was 15 and when the distribution sampled was 
leptokurtlc and the sample sl^e was 25. For these 
conditions the observed Type I error rate underestimated the 
nominal significance levels at the .10 and .05 with normal 
distribution and at the .05 and .01 with the leptokurtlc 
d is C r 1 bu t Ion . These results are consistent with previous 



research findings and support the validity of the data 
generation procediireB used in the study. 

No clear patcern of results were obtained for the tests 
which used the bootstrap methodology- The results for the 
test statistics based on the bootstrap estimates of 
variability for the sample variance had mixed results 
depending on the distributional fornij the test staCistlc, 
the type of transformation used, the sample slae and the 
method of computing the subsample variance* In general the 
bootstrap test statistics had observered Type I error rates 
which overestimated the nominal significance level when the 
d i s t r tbu t ions were normal or leptokurtie* With the 
platykurtlc distribution the observed Type T error rates 
were within two standard errors of the nominal significance 
level for the tests using the F-ratlo eKcept for method 2 
with the 4th root transformation when the nominal 
significance level was underestimated* With the chi=-square 
test the results were mixed. Method I had appropriate error 
rates when sample st^es were at least 25 per group for both 
the log and 4th root transformations. Method 3 ovs res t ima ted 
the .10 and #05 nominal significance levels but 
underestimated the *01 nominal significance levels 

The three methods for computing subsample variance 
provided similar conclusions. When the log transformation 
was used method 1 and 2 had Identical results. With the 4th 
root transformation method 2 provided for an underestimation 
of the nominal flgnificance level when the F^ratio was 
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calculated. With the chi^square test and method 2 the 
nominal significance level was underestimated for the normal 
and platykurtlc d I s t u r bu 1 1 p n s and overestimated for the 
leptokurtle distribution. 

Finally the 4th root transformation did not Improve the 
approximations to the reference distributions for either 
test statistic. Using the 4th root transformation to 
normalise the sampling distribution of the sample variance 
resulted in findings similar to those reported under the log 
transforraatlon* 

Statistical Powe r, The above results Indicate that 
with the axecption of the platykurtic distributions the 
procedures considered In the present study using the 
bootstrap methodology do not provide valid tests for 
variance equality* Given these result attempts to estimate 
statistical power would not be meaningful and no further 
analyses were conducted* 

Summary and Discussion 

Developing a single procedure Tor comparing population 
variances which could be used for all distribution forms was 
the major objective oE the present study. It was hoped that 
by using bootstrap methodology the sampling variability of 
the sample variance could be estimated accurately and an 
F-test or chl-square test could be computed to compare 
estimates of population variances. The Initial results of 
the study Indicated that the unbiased estimator of sample 
variance computed on the bootstrap subsamples underestimated 
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tha population variance. But cwo mod i f ica 1 1 on s for conipuclng 
aubsample variance were both successful in nrovldlng more 
accurate estimates of the population variance. 

Estimating the variability of the sample variance was 
lass successful. The average variability over 200 
replications resultefl in an underestimation of the 
theoretical variance when the sampled distribution was 
normal or leptokurLic, With the platykurtic distribucion 
the estimated variance was similar to the theoretical value* 
These results were consistent across all three methods of 
calculating subsamp^e variance, although the unbiased 
estimator deviated the most from the theoretical value. As 
sample size Increased the bootstrap estimate of accuracy 
also Improved, The mean square errors however were large for 
the leptokurtlc distribution for all sample sl^es Indicating 
considerable variability from replication to repllcacion in 
the estimation of the variability of the sample variance 
with that distribution. The mean square errors for the 
normal and platykurtic distributions were not as ^reat and 
provided some hope that a valid test could be developed 
using the bootstrap methodology. By comparison, Efron (1983) 
had reported ipean square errors for the bootstrap estimate 
of accruacy for a correlation coefficient of the same 
magnitude as those reported here for the normal 
distribution* 

Since the bootstrap estimate of variability for the 
sample variance generally underestimated the theoretical 



value It was predicted that the F-test and chl-square tests 
would have Type I error rate.^ greater than the nominal 
significance level when the d la t r Ibn t Ion s sampled were 
normal or leptokurtic* The extent to which Che observed Type 
I error ratea overestimated the nominal significance level 
was unknown however* The predictions were correct* The 
extent of the overestlmation depended on the distribution 
form, the method of transformation and the test statistic 
usedp Observed Type I error rates were observed similar to 
the nominal significance level however when the distribution 
was platykurtica 

The reaulcs of this study indicate that in the case of 
the sample variance the bootstrap methodology can provide 
some indication of sample accruacy but the degree of 
accuracy depends on the distribution form. If the 
researcher has no Idea of the population distribution form 
then it would be impossible to interpret a bootstrap 
estimate. In addition except for the platykurtlc 
distribution the tests for statistical significance 
using the bootstrap estimate of variabnity resulted in a 
liberal test* Since O'Brien's test for variance equality 
does not overestlmace the Type I error rate for any 
distribution it appears that the bootstrap approach does not 
provide a useful alternative. 
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Table I 



Empirical estlniate^ of fnean, varlfince and mean sqtiare error for thre 
bootatrap estlniates of sample variance over 200 replications when th 
population distribution is normal. 
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Table Tl 



Empirical estlmatea or' 
bootstrap estimates of 



meanp variance and 



mean square error for three 
BnmplB variance over 200 replications when 
population distribution is platykurttc* 



thi 



Methcci 1 



F2 



Var (S ) MPK 



Method 2 



Va r ( S ) MSE 



9 50 


. 902 


.12 1 


.006 


1.016 


.12 1 


100 


^9 12 


,126 


, 00 7 


1 .026 


,125 


200 


• 900 


.129 


.007 


1,012 


.13 2 


The 0 r y 


1 .000 


. 139 




1 . 000 


.139 


15 50 


.953 


, 069 


. 00 j 


1.021 


. 068 


100 


, 928 


.070 


.00 1 


,995 


,07 1 


200 


*917 


.070 


. 00 1 


. 983 


.070 


The o r y 


1 ,000 


.07 3 




1 . 000 


.07 3 


25 50 


* 962 


. 040 


. ono 


1 .000 


,040 


100 


^ 969 


,042 


.000 


1.0 10 


,04 2 


200 


* 960 


,04 1 


. 00 0 


.999 


.041 


The o ry 


1 ,000 


,043 




1 .000 


.043 


30 50 


• 967 


,034 


. 000 


.999 


.034 


100 


.948 


,03 2 


.0 00 


,979 


,033 


200 


.859 


.034 


.0 00 


,993 


,034 


Theory 


1 . 000 


.036 




1 . 000 


.036 



. 008 
> 008 
-008 



.001 
.00 1 
. 00 1 



.000 
,000 
000 



,000 
.000 
. 000 



1.031 
1 . 040 
1 ,029 
1 .000 

1 .026 
. 999 
. 988 

1 .000 



. 000 
.010 
.000 
i 000 

.00 1 
. 982 
994 
000 



Method 3 



VarCS ) MSW 



. 159 
. 164 
.169 
. 139 

,079 
.081 
.081 
.073 

.044 
.04 6 
.044 
.043 

.036 
.035 
,0 37 
036 



0 1 1 

0 1 2 
012 



.012 
-001 
001 



.000 
. 000 
00 0 



pOOO 
. 00 0 
.000 



2^ 



EKLC 



Table ITT 



Empirical estimates oP rnenn* variance and mean square error for three 
bootstrap estimates of sample variance over 200 replications when the 
population distribution is l e p to ku r 1 1 c • 



M e t !i o d 1 Method 2 



Method 3 



Var(S^) MSR Va r ( S ^ ) mM 1^ VaTCS^) 



on 
4* o 
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Table IV 



Observed Type 1 error rates for 13 tests of variance equality with 
sample slzeii of 15, 25s and 30 when the nominal significance level 
is *0l . 



Procedure 



0 » Brien 



F-ra c lo 



Normal , Platykurtic LepCokurtlc 



Mf 
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"3 



4th root M 
"3 
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1 



M ^ bootstrap sub sample variance calculated using the unbiased 

estimator of subsample variances 
bootstrap subsample variance calculated by dividing the sum 

of squares by n-2 
bootstrap subsample variance calculated by subtracting sample 

mean rather than the siibs ample mean in computing the Bum of 

s q ua re s . 
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Table V 



Observed Type I error races for 13 tests of variance equality with 
sample sizes of 15^ 25 and 3 0 when the nominal significance level 
is *05* 
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bootstrap sub samp la variance calculated using the unbiased 

estimator of sub sample variance, 
bootstrap sub sample variance calculated by dividing the sum 

of squares by n-2, 
bootstrap subsample variance calculated by subtracting sample 

mean rather than subsample mean in computing the sum of 

squares , 
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Table VI 



Observed Type 1 arror rates for 13 tests of variance equality with 
sainple sizes of 15 ^ 25 and 30 when the nominal significance level 
is .10* 



Normal Platykurtic Leptokurtic 



Procedure 15 25 30 15 25 30 15 25 30 
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. 124 
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.226 


.244 



bootstrap subsample variance calculated using the unbiased 

estimator of subsample varianee, 

bootstrap subsample variance calculated by dividing the sum 

of squares by n^2 , 

bootstrap subsample variance calculate by subtracting sample 

mean rather than subsample mean In CQmputlng the sum of 

squares . 
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